GROWTH OF POSITIVE WORDS AND LOWER BOUNDS OF THE 
GROWTH RATE FOR THOMPSON'S GROUPS F(p) 
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Abstract. Let F(p), p > 2 be the family of generalized Thompson's groups. Here F(2) 
is the famous Richard Thompson's group usually denoted by F. We find the growth 
rate of the monoid of positive words in F(p) and show that it does not exceed p + 1/2. 
Also we describe new normal forms for elements of F(p) and, using these forms, we find 
a lower bound for the growth rate of F(p) in its natural generators. This lower bound 
asymptotically equals (p — 1/2) log 2 e + 1/2 for large values of p. 



Introduction 

The family of generalized Thompson's groups F(p) was introduced by K. S. Brown in 
Additional facts about these groups can be found in [SJ 122] ■ The case p = 2 corresponds 
to the famous Richard Thompson's group F. See the survey [5| for details about this 
group. 

The groups F(p) have many common features. All of them are embeddable into each 
other None of them has free non-abelian subgroups. None of these groups satisfy 
any nontrivial group law. The derived subgroups of each of the F(p) is simple (infinitely 
generated). Every proper homomorphic image of F(p) is abelian (so these groups are not 
resi dually finite). Each F(p) is finitely presented and has quadratic Dehn function [To] . 

Each of these groups has a faithful representation by piecewise linear functions. The word 
problem has an easy solution in each of these groups. Also all these groups are diagram 
groups in the sense of Namely, F(p) is a diagram group over a very simple semigroup 
presentation (x \ x = x p ). It follows then from ^3 Section 15] that F(jp) has solvable 
conjugacy problem. Each group F(p) satisfies homological finiteness condition T^- All 
integer homology groups H n (F(p), Z) are free abelian of finite rank and the Poincare series 
are rational [TTi] . 

However, there is some difference between the groups of this family. Brin 0\ described 
the group AutF for F = F(2). Some information about automorphisms of F(p), where 
p > 2, can be found in where it is shown that already for p = 3 there are "wild" 
automorphisms of F(p). 

The goal of this article is to obtain analogs of some results for the group F. The first 
author found the growth function of the monoid of positive elements of F. This function 



Date: February 2, 2008. 

2000 Mathematics Subject Classification. Primary 20F32; Secondary 05C25. 
Key words and phrases. Generalized Thompson's groups; growth. 

Both authors thank the Centre de Recerca Matematica for the hospitality and support. The research 
of the second author was also partially supported by the RFFI grant 05-01-00895. 

1 



is rational, namely, it equals 

1-x 2 
1 — 2x — x 2 + x 3 

Notice that the elements Xq, Xi, . . . , generate a free submonoid of rank p in F(p). 
Thus the growth rate of positive elements in F(p) is at least p. In this paper we show that 
for any p, the exact value of the growth rate of positive elements is only slightly higher 
than p — it never exceeds p + 1/2. 

Guba and Sapir ^H] found two new normal forms for elements of F. One of them is a 
normal form in the infinite set of generators. This normal form is locally testable (unlike 
the standard normal form). It has the same feature as the normal form in the free group: 
a word is in a normal form if and only if all its subwords of length 2 are in the normal form. 
In this paper, we find such a form for every F(p). Another normal form constructed in 
[TH] for F allows one to construct a regular set of normal forms in F. We find an analogous 
construction for each F(p). 

Using the above regular normal form, the second author proved in that the growth 
rate of the group F in generators Xo, x\ is at least (3 + v / 5)/2. Notice that neither the 
growth function, nor the growth rate for F is known at the present. In this paper we find 
a lower bound of the growth rate for each of the groups F(p), where the generating set 
consists of xo, xi, . . . , x v -\. We show that the lower bound is a root of a certain algebraic 
equation and find the asymptotic behaviour of this root. For large values of p, this is 
(p - 1/2) log 2 e + 1/2, where log 2 e = 1.442695 .... 

The plan of the paper is as follows. In Section ^ we recall the definition of the family 
F(p) of generalized Thompson's groups and some basic facts about growth functions and 
growth rates. This Section also contains a description of (positive) elements in F(p) in 
terms of rooted p-trees. 

In Section El we describe Fordham's method to calculate the word length in F(p). We 
restrict ourselves to the case of positive words only (the description for this case is much 
simpler). Recall that for the case p = 2, a fast algoritm to find the word length metric 
was described in jTH [TJ] . This algorithm is very effective but it has quite a complicated 
description. A simplification of the method due to Belk and Brown can be found in pQ. 
One of the easiest algorithms to find the word length in F (the so-called Length Formula) 
is contained in ^Sl Section 5]. Notice that for p > 2, none of the simplified versions exists 
so we use Fordham's approach from [TH] . 

In Section El using Fordham's method, we find equations for generating functions de- 
scribing the growth of F + (p). We solve these equations in Section 0] and show that the 
generating function for positive words in F(p) is irrational provided p > 2 (unlike the case 
p = 2). Then we find the growth rate of positive words in F(p) as a root of an algebraic 
equation. We prove that this growth rate never exceeds p + 1/2 approaching this value 
as p approaches infinity. Thus the set F + (p) of all positive words is not much higher than 
the free submonoid generated by xq, x\, . . . , x v -\. 

Section El describes two new normal forms of elements in F(p). The first of these forms is 
locally testable (one needs to test only subwords of length 2, similar to a free group). The 
second of the normal forms leads to a regular language that represents each element of F(p) 
exactly once. Based on that regular language, we construct the corresponding automaton 
and find a lower bound for the growth rate of F(p) in SectionlHl This lower bound is given 



as a root of an algebraic equation. We also describe its asymptotic behaviour showing 
that it approaches (p — 1/2) log 2 e + 1/2 for large values of p. 

1. Preliminaries 

The family of generalized Thompson's group can be defined as follows. The group F(p) 
is the group of all piecewise linear self homeomorphisms of the unit interval [0, 1] that are 
orientation preserving (that is, send to zero and 1 to 1) with all slopes integer powers 
of p and such that their singularities (breakpoints of the derivative) belong to Z[ ^ ]. The 
group F(p) admits a presentation given by 

(1) (xi (i>0) | XjXi = XiXj+p^ (i < j) ). 

This presentation is infinite, but a close examination shows that the group is actually 
finitely generated, since xq, X\, . . . , x p -\ are sufficient to generate it. In fact, the group 
is finitely presented. The finite presentation is awkward and it is not used much. The 
symmetric and simple nature of the infinite presentation makes it much more adequate 
for almost all purposes. 

One such example where the infinite presentation is particularly appropriate is in the 
construction of the normal form. A word given in the generators Xj and their inverses, 
can have its generators moved around according to the relators, and the result is the 
following well-known statement: 

Theorem 1.1. An element in F(p) always admits an expression of the form 

T* ■ T* - • • • t* ■ r f • • • T* T* 

% m- Xj j n • V j 2 jl ' 

where 

h < k < ■ ■ ■ < V, h < h < • ■ ■ < 3n- 

In general, this expression is not unique, but for every element there is a unique word of 
this type which satisfies certain technical condition (see |Hj for details). This unique word 
is called the standard normal form for the element of F(p). 

Observe that the infinite presentation for F(p) is actually a monoid presentation. Hence 
F(p) admits a submonoid, the submonoid F + (p) given by the same presentation, whose 
elements are called positive words. Theorem 11.11 shows that F(p) is the group of right 
fractions of this monoid. 

An element of F(p) can be represented by two subdivisions of the interval [0, 1], namely, 
the subdivision into intervals which get mapped linearly to each other. A subdivision of 
this type, where the dividing points are all in Z[ - ], can always be obtained by subsequent 
subdivisions of the interval into p equal pieces. Hence, a subdivision of the interval is 
equivalent to a rooted tree where each vertex has valence p + 1 except the root, which has 
valence p (or 1 in case when the tree consists of the root only), and the leaves, which have 
valence 1. A node (except the root and the leaves) is pictured to have one edge going up 
and p edges going down to its p children. These trees will be called rooted p-trees. An 
element of F(p) is then represented by a pair of rooted p-trees called the source tree and 
the target tree. This representation has been extensively studied in the case p = 2. Note 
that positive words can be represented by a single p-tree, because the other tree is always 
the same: the tree which consists of all right carets. 
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A piece of these p-trees consisting of a node and its p edges going down to its children 
is called a caret. Carets are the building blocks of the trees and they give rise to the 
algorithm for finding the word metric in F(p), see Section El 

As stated in the introduction, the exact growth function for the groups F(p) is not known. 
In this paper we will give lower bounds for growth rates of these groups, computing lower 
bounds for the number of elements in each length. 

To be precise, given a finitely generated group G with finite generating set X, denote its 
sphere of radius n by 

S(n) = {geG\i(g) = n}, 

where £(g) is the length of g G G in the set of generators X. We also have the ball of 
radius n 



B(n)= Qs(fc). 

If 7„ = #B(n), the series 

oo 

r(*) = j>»* B 



n=0 



is called the (general) growth function for G with respect to X, and the number 

7 = lim 7„ 

n— >oo 

is the growth rate of G with respect to X. The limit always exists due to the sub multi- 
plicative property of j n , that is, ■y m +n < 7m In for all m, n > 0. Also, the spherical growth 
function is given by a n = #S(n) and 



n=0 



which has the same growth rate as the general growth function (for all infinite groups). 
For details about growth functions, see, for instance, [Tlj . 

If P C G is a subset of a group, not necessarily a subgroup, we can define the growth 
functions of the set P by the same formulas as above but where the coefficients are actually 
the cardinals of the sets P H B(n) or P D S(n). The goal for one of the next sections is to 
compute the growth series of the subset F + (p) in F(p). In order to do that, we need to 
describe the algorithm for calculating the word metric in F(p). 



2. Positive words in Thompson's groups F(p) and Fordham's method 

In 1995, S. Blake Fordham [TJ] constructed an algorithm which, for any given element in 
F = F(2), finds its distance to the identity in the word metric given by generators Xq, 
x\. This algorithm consists in defining different types of carets, then having each caret of 
the source tree paired to its corresponding caret in the target tree, and assigning a weight 
to each type of pairs of carets. A table is given for all possible pairs of types, with the 
assignment of the weight. The sum of all the weights of all the pairs is the exact distance 
from the element to the identity. In a set of unpublished notes jTHj, Fordham extends his 
method to the groups F(p). This method will be the starting block of the computation. 



The method used to compute this growth will be an extension to F(p) of the method 
developed in [2j for the case of F — F(2). Consider a positive element of F(p). As we 
know, the element can be represented by a rooted p-tree. We are going to define different 
types of carets and their weights, following Fordham [T3] . 

A caret will be called left or right if it is situated in the leftmost edge of the tree or in 
the rightmost edge, and middle or interior if it is situated in the middle, i.e. if it is not 
right or left. For instance, a caret is left if it represents a subinterval of [0, 1] which has 
left endpoint equal to zero. Middle carets will be subdivided into p — 1 types, denoted 
by Ai 1 , A4 2 , . . . , M. p ~ l according to which caret they are children of, and its position as 
child. 

The children of a caret are subdivided in two types, the predecessors and the successors. 
This subdivision will give a total order to the set of carets, with a caret being always after 
its predecessor children and before its successors. The definitions of the caret types are 
as follows: 

• The root caret is special. Its children are: 

— Its left child is a left caret and it is the only predecessor. 

— Its middle children are successors, and have types M 1 , A4 2 , . . . , Ai p ~ 2 , in 
order-preserving way. 

— Its right child is obviously a successor and a right caret. 

• A left caret has the following children: 

— Its only predecessor is the left child, a left caret. 

— All the other children are successors, all middle carets, and of types A4 1 , M 2 , 
. . . , Ai p ~ x , in order. 

• A right caret has the following children: 

— One single predecessor of type Ai p ~ l . 

— It has p — 1 successors, which in order are of types A4 1 , A4 2 , . . . , Ai p ~ 2 and 
the last one of type 1Z. 

• A caret of type M. 1 (1 < % < p — 1) has the following children: 

— The first p — i children are predecessors, and their types are Ai l , . . . , M. p ~ l . 

— The other i children are successors, and they are of types A4 1 , A4 2 , . . . , M 1 . 

For the purposes of computing the length of an element, these caret types are subdivided 
in further types depending on the existence of predecessor and successor types. This 
classification is actually more complicated in Fordham's paper but we do not need the 
total strength of the method since we are dealing only with positive words. We will 
indicate also which is the weight of each caret for the purposes of the computation of the 
length of a positive word. 

The caret types are as follows: 

• The root, which has always weight zero. 

• Left carets, which have always weight one. 

• Carets of type 1Z are right carets whose all successors are right carets, i.e., it 
has no middle successors. Its only successors hang from its rightmost leaf. These 
carets carry weight zero. 

• Carets of type TZm are right carets which are not 1Z , that is, which have middle 
successors. Observe that the middle successors do not have to be immediate 



successors, they can be successors of successors. Carets of type 7?.^ have weight 
two. 

• Carets of type M. l are middle carets which do not have any successor children. 
They carry weight one. 

• Carets of type M. l M are middle carets which have at least a successor child. These 
carets have weight three. 

Observe that the index on the middle carets is only necessary to identify its successors, 
but it has no role in the weight assignment beyond that one. 

Now, the main theorem giving the length is as follows: 

Theorem 2.1. (S. B. Fordham) J3| Given a positive word in F(p) represented by a rooted 
p-tree, the distance from this element to the identity (in the word metric for F(p) with 
generators Xq, X\, . . . , is equal to the total sum of the weights of its carets. 

3. Generating functions for the growth of positive words 

Once the theorem for the length has been established, now the computation of the growth 
function is reduced to a combinatorial problem, namely, finding how many trees have a 
given weight, according to the rules above. The method for finding the number of trees 
with a given weight is to split the trees in several ones in such a way that recurrences can 
be found. The reader can see details about generating functions in and can see this 
method used already in [7j. 

We will make use of several sequences: 

• The sequence s n = #(F + (p) n S(n)). This is the number of trees which have 
weight n. 

• The sequence l n . This sequence gives the number of subtrees which can be left 
subtrees of a rooted p-tree and such that its total weight is n. The subtrees are 
required to be strict, that is, the main tree does not qualify as a left subtree. 

• Analogously the sequence r n is the sequence of possible right subtrees of weight n. 

• The sequence m4 for i = 1, . . . ,p— 1, gives the number of interior subtrees which 
start with a caret of type M 1 . Observe that this subtree is completely composed 
of middle carets, and also with total weight n. 

Observe that the subtrees are always considered as subtrees of the main tree, which means 
that, for instance, a left subtree never has carets of type 1Z because that would mean it 
is the total tree. A subtree which starts in an A4 l caret has all interior carets. 

Each one of these sequences will have its generating function: 

oo oo oo oo 

n=0 n=0 n=0 n=0 

Now we will establish relations between the sequences which will give functional equations 
for their generating functions, which then will allow us to find the growth of the submonoid 
of positive words. For instance, if one considers the tree representing a word, and assumes 
the tree has total weight n, since the root has weight zero, the weight has to be distributed 
among all the p children subtrees. Hence, a tree of total weight n will be obtained every 



time that we take a family of subtrees such that the sum of their separate weights as 
subtrees is n. 



This fact gives the first formula satisfied by the sequences, and also by the generating 
functions: 



(2) s n = l io m 



■ - mf 2 V.,- 



10 jl jp-2 ' 3p-l 

JoH hjp-i=n 

(3) S = LM 1 ■ ■ ■ M P _ 2 R. 

To find a formula for the function L(x) of left subtrees, one needs to consider that left 
carets have weight 1. Hence the different subtrees only have to add up to n — 1. The 
formula is 

; ; (!) (P-2) (p-1) 

ioH hjp-i=n-l 

(4) L — 1 — .rl.M.M-, ■ ■ ■ M p _i. 

The formula for the generating functions is obtained by multiplying each side of the 
formula for sequences by x n . The right hand side has an x multiplying because the 
indices are shifted by one. 

For the function for right trees, one has to take into account the fact that a right caret can 
be of type 7Z or TZm, with weights zero and two respectively. For the first possibility, the 
caret is of type TZ , and all its successors have no weight. Observe that in a positive word 
there can be one and only one caret of type TZ , because any others would be reducible. 
Hence, if the caret is of type TZ , all the weight is concentrated in its only predecessor. 
So there are as many right subtrees of this type as trees of the type M 13 ' 1 with the same 
weight, which gives the first part of the recurrence equal to mi P x \ 

If the right caret is of type TZm, it carries weight 2 and one the successors is necessarily 
nonempty with a middle caret somewhere. Hence if one of the successors is necessarily 
nonempty, the term in the recurrence has all possible weights for these successors. The 
formula is 

(p-i) i (p- 1 ) (!) (p-2) 

r n = m n + y, m ^ m)/---m\ „ r,- 



jo jl jp-2 3v- 



jo-\ h,jp_i=ri-2 

JlH hjp-l>l 



(5) R = Mp-i + x 2 (M 1 M 2 ■ ■ ■ M^R - M p _i). 

Finally, the middle subtrees are the ones whose children are also middle subtrees and hence 
facilitate the resolution of the equations. A middle caret of type M 1 has either weight 1 
if its successors are empty or weight 3 if one of the successor subtrees is nonempty. Both 
cases correspond to the two adding terms of the formula for the sequence: 

U) 00 (p-i) , (») (p-i) (i) (») 

m; ; = > m) • • • m: + > m)' ■ ■ ■ m) my ■ ■ ■ m) ' 

11 / J Ji Jp — 1 / J 30 Jp — i — 1 Jp — i Jp — 1 

ji-\ h?p-i=n-l jo+jiH hjp-i=n-3 

j P - t H hjp-i>l 

which gives the following formula for the generating functions: 

(6) Mi - 1 = ./■ .\ I j .\ I ; . i • • • M p _! + ./•"• UU . i • • • M p _ l (M 1 ■ ■ ■ M, , .U, - 1). 



Solving these equations will give us information on the function S(x), which is the one 
we are interested in, and the growth of positive elements in the groups F(p). 

4. Growth functions and growth rates of F + (p) 

Now we collect formulas (j3J), (JH), (JSJ), © to find the equation on S(x) and the radius of 
convergence of the corresponding series. First of all, we have to mention that F + (p) has a 
free submonoid generated by x , x\, . . . , and so the growth rate of F + (p) is at least 
p. As we will see at the end of this Section, the exact value of the growth rate is only 
slightly larger than p. (In fact, it is always less than p + 1/2.) 

Let 

M(x) = M 1 {x)M 2 {x) ■ ■ ■ M p _!(x). 
Lemma 4.1. For all < i < p — 1 , we have 

MlM2 ''' M -=(T^W + 1 -* 2 - 

Proof. We proceed by induction on i. If i = 0, then the result is obvious. Let 1 < % < p—1. 
Formula © can be written as 

Mi = l + + x 3 MiM - 



M. ■■■ M, i V .U; •••.!/, i 

Therefore, 

Mi • • • Mi = Mi • • • M;_i + xM + x 3 M ■ M x ■ ■ ■ M { - x 3 M 

and so 

M x ■ • • M(l - x 3 M) = (x- x 3 )M + M x ■ • • M_i. 
Using the inductive assumption, we have 

Mi ■ ■ ■ M(l-a; 3 M) = (x-x 3 )M+ ^ _ +l-x 2 = (1 _ +(l-x 2 )(l-x 3 M). 

Now the only thing left to do is to divide by 1 — x 3 M. □ 
Taking i — p — 1 gives us 

Corollary 4.2. The function M = M(x) satisfies 

x 2 M = — - - + x 2 - 1. 

(1 - x 3 M)p~ 1 

Now we express S(x) in terms of M(x). It follows from (jlj) and (0) that 

£ 1 R (1-^)M P -! 

1 - xM 1 - x 2 M 

Now, using (JHJ), we have 

^ _ LMi? _ (1 - x 2 )M 



M p _x (1 -xM){l -x 2 M)' 



The first author proved in [7j that the growth function S(x) of positive elements of F 
F(2) is rational (although M(x) is irrational). Now we have the following 



Theorem 4.3. The growth function S(x) of positive elements in F(p) is irrational pro- 
vided p > 3. 

Proof. Let N = (1 — x 3 M) _1 . From Corollary 14.21 we have 

1-N' 1 = x 3 M = xN?- 1 + x 3 - x. 
Hence N = N(x) satisfies the equation 

(8) xN p + (x 3 - x - 1)N + 1 = 0. 

Suppose that S(x) is rational. Then it follows from (J7J) that M(x) satisfies a quadratic 
equation with coefficients in the field Q(x) of rational functions. Since M = x~ 3 (l — N^ 1 ), 
the function N(x) also satisfies an equation of degree at most 2 over Q(x). This implies 
that the polynomial f(t) = xt p + (x 3 — x — l)t+l from Q(x) [t] is divisible by a polynomial 
of degree at most 2. Since p > 3, the polynomial f(t) is reducible over Q(x). A standard 
algebraic trick (using Gauss' lemma) implies that f(t) is a product of two polynomials 
from Z[x] [t] of degree less than p. Taking x — 1, we obtain that the polynomial t p — t + 1 
is reducible over Q. However, this contradicts a result from [20] • D 

Now we will find the growth rate of F + (p). To do that, we need to take the radius of 
convergence of the series for S(x) and take the reciprocal. Observe that from (j2J) we 
deduce m n < s n for all n > 0. This implies that 

(limsupm n ) _1 > (limsup s n ) _1 , 

n— >oo n— *oo 

that is, the radius of convergence of the series S(x) does not exceed the one for the series 
M(x). Let x > be a real number such that S(x) converges. Then M(x) also converges 
and formula (|IJ) holds. 

To find the radius of convergence of S(x), we need to find the smallest positive real number 
such that the denominator of the right hand side of is zero. Since M(x) is increasing 
and < x < 1, the smallest positive solution of the equation M(x) = x~ l will not exceed 
the smallest positive solution of the equation M(x) = x~ 2 . Therefore, we need to solve 
the equation M(x) = x~ x . Notice that M(x) increases and x' 1 decreases so we can just 
speak about a positive root of this equation. Using (|4.2jl . we get x = (1— x 2 )^^ 1 ^ +x 2 — 1, 
that is, we need to find the positive root of 

(9) (l-x 2 f- l (l + x-x 2 ) = l. 

The growth rate of F+(p) will thus be equal to x~ l . We already know that the growth 
rate of F + (p) is at least p, as it was mentioned in the beginning of this Section. Hence 
x < 1/p. 

Let us rewrite this equation in the following form: 

ln(l + x — x 2 ) 
p ~ 1= -ln(l-x 2 ) ' 

From the Taylor formula for ln(l +y), we deduce the inequality 

y - y 2 /2 < ln(l + y) < y - y 2 /2 + y 3 /3, 

where y > 0, and then we get ln(l + x — x 2 ) < x — 3x 2 /2 + 4x 3 /3 — 3x A /2 + x 5 — x 6 /3 < x — 
3x 2 /2+4x 3 /3. Since -ln(l-x 2 ) > x 2 , we have p-1 < x~ 1 -3/2+4x/3 < x" 1 -3/2+4/3p. 
So x~ l > p + 1/2 - A/3p = p + l/2 + o(l) as p -»• oo. 



1U 



Now we want to show that x 1 < p+ 1/2. We have ln(l + x — x 2 ) > x — x 2 — (x — x 2 ) 2 /2 = 
x - 3x 2 /2 + x 3 - x 4 /2 and - ln(l - x 2 ) = x 2 + x 4 /2 + x 6 /3 + • • • < x 2 + x 4 (l + x 2 + 



ar h )/2 = x 2 + x 4 /(2 - 2x 2 ) < x 2 + 2x 4 /3 because x < 1/p < 1/2. This gives 

p - 1 > (x - 3x 2 /2 + x 3 - x 4 /2)/(x 2 + 2x 4 /3) = (1 - 3x/2 + x 2 - x 3 /2)/(x + 2x 3 /3). 
Finally, 

1 - 3x/2 + x 2 - x 3 /2 1 9-2x + 3x 3 

v- 1 > '- — '— = ; — > 1/x - 3/2 

v x + 2x 3 /3 x 2(3 + 2x 2 ) 1 1 

since 3x 2 — 6x — 2 < on [0; 1]. This gives x~ l < p + 1/2, as desired. So we get the 
following result. 

Theorem 4.4. The growth rate of the monoid F + (p) of positive elements in the group 
F(p) generated by xq, X\, . . . , x p -\ is a number ( p , which is the root of equation 

(y 2 -iy-\y 2 + y-l)=y 2 P. 

This number has the form ( p = p + X p , where < X p < 1/2 for all p and X p — ► 1/2 as 
p — > oo. 

Indeed, we proved inequalities p + 1/2 — 4/3p < x _1 < p + 1/2, where x is the solution of 
0. The inequality x -1 > p obviously follows for p > 3; if p = 2, then it is known from 
[Zj that C2 > 2.24. 

The equation in the statement of Theorem 14.41 is equivalent to via the substitution 
y = 1/x. Notice that x and y are roots of polynomials of degree 2p — 1 with integer 
coefficients. Also let us mention without proof that X p is strictly increasing with respect 
to p. 

The number ( p gives a lower bound for the growth rate of the group F(p). However, this 
estimate can be essentially improved. 



5. New normal forms for elements of F{p) 



We are going to find two new normal forms for elements of F(p). They will be analogs of 
the normal forms constructed in ^H] for the case F = F(2). 

The first of these normal forms will involve the infinite set of generators E = { x« (i > 0) }. 
Consider the following rewriting system T = T(p) over the alphabet S 1 * 11 = SUE -1 (basic 
facts about rewriting systems can be found in |2l ITU]): 

(1) xfx" e -»• 1 (i > 0, e = ±1) 

(2) x)Xi -> x i x^ +p _ 1 (j > i, e = ±1) 

(3) x^^xr 1 -> x ~ x x) (j >i,e = ±1) 

Notice that for every rewriting rule of T, the left hand side and the right hand side are 
equal in F(p). 

It is easy to see that T is terminating, that is, for every word w, the process of applying 
rewriting rules to w always terminates. Indeed, T either decreases the length of a word or 
it preserves the length. In the second case, if we make a vector that consists of subscripts 
of a word, the rewriting rules will decrease this vector lexicographically. 

Since T is terminating, applying the rewriting rules to a word w gives us a word v that 
cannot be reduced (that is, no more rewriting rules can be applied to v). We say that v 



is an irreducible form of w. Now we are going to check that F is also confluent, that is, 
every word has a unique irreducible form. To do that, we apply the Diamond Lemma. In 
our case, this means that if we have rewriting rules of the form ah — > u, be — > v, where a, 
b, c, d are letters and u, v are words, then uc and av have a common descendant. There 
are only finitely many cases to check, and all of them are easy. We will show one of these 
cases, the rest is left to the reader. 

Let us take the rewriting rules — > xj 1 x £ k and — > XiXjh-u where k > j > 
i, e = ±1. We have: 



and 



rp 1 />> *~ rp ^ rp rp k rp rp 1 rp^ 



rp^~ rp . rp v rp . rp^ rp v rp , rp rp^~ 

• L, k+p-l- h i dj j+p-1 Uj i- h k+2p-2- L j-p-l u "i u 'j+p-l J 'k+p~l- 



So the words have a common descendant. 

Now we know that T is complete, that is, terminating and confluent. Therefore, each 
element of F(p) can be uniquely represented by an irreducible word. So we have proved 
the following 

Theorem 5.1. Each element g 6 F(p) can be uniquely represented as a word of the form 



N{g) = xllxf 2 ---x1™, 



where m > 0, S\, 62, ■ ■ ■ , e m = ±1, and for every 1 < k < m one of the following conditions 
holds: 

• i k < ik+i 

• ik= k+i and e k = e k+1 

• < i k - ik+i < P and e k+ i = -1. 

Indeed, the conditions listed in the statement exactly mean that the word N(g) is irre- 
ducible, that is, it has no subwords that are left hand sides of the rewriting rules of T. 
The set of these irreducible words over X^ 1 will be denoted by A/i n f ■ 

Notice that the set A/j n f has the following property: a word belongs to A/inf if and only if 
all its subwords of length 2 belong to A/inf. That is, the normal form of Theorem 15.11 is 
locally testable. 

Now we will construct another normal form for elements of F(p). Now all words will 
involve only the finite set of generators x^ 1 , xf 1 , . . . , x^\. Moreover, these normal forms 
will give a regular language closed under taking subwords. Notice that this gives a regular 
spanning tree in the Cayley graph of F(p) in the above generators. As in ^Sj for the case 
p — 2, this tree is not geodesic. 

It is possible to write down a new rewriting system in order to get the normal form we 
wish to construct. However, it will take too much effort to prove that the rewriting system 
ijs complete. We choose an approach that differs from 



Let j > 1. Then j can be uniquely expressed in the form j = r + d(p — 1), where 
1 < r < p — 1, d > 0. In this case Xj equals in F(p) to the word Xq d x r x^. For any 
word w over E , replace each letter of the form x^ [j > 1, e — ±1) by where 
j = r + d(p — 1), 1 < r < p — 1, d > and then freely reduce all subwords of the form 
x £ q Xq £ (e = ±1). We obtain a word in generators Xq l , xf 1 , . . . , x ^ denoted by w. 



Lemma 5.2. If w G A/inf? w no subwords of the following form: 



(1) a;fa;- e (0 < i < r - 1) 

(2) a^xga^ (£;>0 ; l</3<a;<r-l) 

(3) a^xg +1 x^ (A; > 0, 1 < (3 < a < r - 1) 

(4) x £ a x k Q +1 Xf3 (k > 0, 1 < a < (3 < r - 1) 

(5) x^x^Xp 1 (k > 0, 1 < a < 13 < r - 1) 

The words of the form 1) - 5) are called forbidden subwords. The set of words in 
{ Xq 1 , xf 1 , . . . , Xp\ } without forbidden subwords will be denoted by Afi n - 

Proof. Let w G A/j n f have the form 

on — rr^Orph J1J2 . . . kh-1 l h k h 

yiu) w — x x ai x x a2 x Q x ah x , 

where /i > 0, cti = r\ + g?;(j» — 1), 1 < ri < p — 1, di > 0, U ^ for all 1 < 2 < /i. By 
definition, 

/T T \ ,7. _ „feo-di„ii ™dl+fci-d2 d h _ 1 +k h _ 1 ~d h u d h +k h 

\ L1 J W — X X ri X Q X r2 ---X Q x r h x 

Suppose that w is not freely irreducible. Then there exist an % from 1 to h — 1 such that 
di + ki — d i+ i = 0, Tj = r i+1 , and and /j/j+i < 0. By definition, words from A/i n f have no 
subwords of the form x^ l x$ for j > and also have no subwords of the form x^Xq 1 for 
j > p. This implies ki < 0. 

Suppose that fcj < 0. Then di = (otherwise > p). Since <ij + i > 0, we obtain 
d-i + ki — di + i = ki — d i+ i < 0. This is a contradiction. Therefore, ki = and so di = d i+ i. 
This implies ttj = Tj + di{p — 1) = r i+1 + G? i+1 (j» — 1) = Thus the word w is not freely 
irreducible since kk + i < 0. We have a contradiction. This proves that w has no subwords 
of the form 1). 

Suppose that w has a subword of one of the forms 2) - 5). Let xf^XQ X+kl ~ dx+1 xf^ +i be such 
a subword, where 1 < % < h. As above, ki < 0. Suppose that ki ^ 0. This implies di = 
and di + ki — di + i < 0. But none of the words 2) - 5) can contain Xq 1 . This allows us to 
conclude that ki = and w contains v = xf^xf dx+1 xf^ +l as a subword. 

Suppose that v satisfies condition 2). This means that di > dj+i, rj > r i+ i, > 0. 
Hence u> contains x^x ai+1 , where cti = r\ + di{p — 1) > r i+ i + d i+ i{p — 1). So w does not 
belong to Amf, which is impossible. 

Suppose that v satisfies condition 3). Now di — d i+ i > 1, > r i+1 , < 0. This leads 
to di — a i+ i = (rj — r i+ i) + (rfj — d i+ i){p — 1) > p, which also contradicts u> G A/i n f- 

Suppose that f satisfies condition 4). Then di — d i+ i > 1, r\ < r i+1 , l i+1 > 0. Now 
n - r i+l > 1 - (p - 1) = 2 - p and so a { - a i+1 = (n - r i+1 ) + (di - d i+1 )(p - 1) > 
(p — 1) + (2 — p) > 0. Thus w contains x^x ai+1 with «j > aij + i. This cannot happen by 
definition of A/inf- 

Finally, suppose that v satisfies condition 5). Now we have di — d i+ i > 2 and so cti — a i+ i = 
(j'i — r i+i) + (di — d i+ i)(p — 1) > 2(p — 1) + (2 — p) = p. However, it should be «j — a i+ i < p 
because w G A/i n f- 

The proof is complete. □ 



For every g 6 F{p), we have the word N(g) € A/fi n that represents (7. We will prove that 
g is represented uniquely by a word from A/fin- This will follow from 

Lemma 5.3. Tfte mapping w 1— > w from Af^ to A/fi n a bijection. 

Proof. We prove first that the mapping w 1— > w from A/j n f to A/fi n is injective. As above, let 
u> G A/jnf have the form (fTUjl. Thus iu equals (fTT|). Suppose that we know the word w, that 
is, we know the numbers m = k — d%, vn\ = d% + ki — d 2 , ■ ■ ■ , rn h -\ = dh-i + — dh, 
m h = k h + dh. Our aim is to recover the numbers k , d\, ki, . . . , dh-i, fc/i-i, dfe, 

Let ft > 1. It follows from the definition of A/i n f that fc^ < 0. Moreover, either kh < 
and rffc = 0, or fc/j = 0. In the first case = kh + dh = kh < 0, in the second case 
m h = kh + dh = dh > 0. Since we know rrih, we can distinguish between these two cases. 
Namely, if rrih < 0, then dh = 0, kh = mh- If rrih > 0, then kh = 0, dh = mh. Now we 
know dh and kh- 

If ft > 2, then /c/ l _ 1 < 0. As above, we have one of the two cases: kh-\ < 0, dh~\ = 0, 
or kh-i = 0. The number kh-i + rf^-i is negative in the first case and nonnegative in 
the second case. But this number equals rrih-i + dh, so we know it and thus we are able 
to distinguish these cases. In the first case we have dh-i = 0, kh~\ = rrih-i + dh', in the 
second case — kh-x = 0, dh-i = m>h-i + dh- Therefore, we know dh-\ and kh-i- 

Continuing in this way, we get the values of dh-2, k h -2, ■ ■ ■ , d%, k\. At the final step we 
get k = m + d\. 

Now we show that the mapping is surjective. We start with a word from A/gn- This word 
has the form 

l ±z J x x ri x x r 2 x X 7-h X • 

Using the rules described in the first part of the proof, we define the numbers k , d\, k\, 
. . . , dh, kh- It follows that di > for all i from 1 to ft. So we can form a word w as in 
(fTTljl. where ati = ri + c?j(p — 1) (1 < i < ft). It is obvious that w equals the word (fT^jl . It 
remains to prove that w belongs to A/i n f- 

Let us assume the contrary. Since w has no subwords of the form XqXq 6 , it should contain 
one of the following subwords: 

a) x\x^ e (i>l,e = ±1); 



b) Xj Xi (j > i); 

c) xflp-tXi 1 (j > i). 

In case a), w will contain a forbidden subword of the form x e r x~ e . Notice that ki < for 
all 1 < % < ft; if fej < 0, then = 0. This means that in cases b) and c) one has % > 1. 
Let j = a + d(p — 1), i = (3 + d'{p — 1), where 1 < a, (3 < r — 1, d, d' > 0. Applying the 
"bar" mapping to b) and c), we see that the word w contains u = x^ 1 XQ~ d x@ in case b) 
and v = x^ l XQ~ d +l x~^ in case c). Since j > i, we have (d — d')(p — 1) > (3 — a > — (p— 1). 
Hence d — d! > 0. If u is not forbidden, then a < (3. But in this case d — d' > so u has 
to be forbidden anyway. If v is not forbidden, then we also have a < (3, which implies 
d — d' + 1 > 2. We have a final contradiction. 



The proof is complete. 



□ 



From Lemmas 15 . 21 and 15 . 31 we obtain 

Theorem 5.4. Each element g G F(p) can be uniquely represented by a word w G A/fin- 
This means that for every g G F(p) there is exactly one word over {xq,Xi, ■ ■ ■ -, x p-\ } 
that represents g and has no forbidden subwords. This gives a regular set of normal forms 
for the group F(p). 

Indeed, the set of forbidden subwords is a regular language. So the set A/fi n of words that 
do not contain forbidden subwords will be also regular. Throughout the rest of the paper, 
we will denote this language by C p . (For basic properties of regular languages see [2*T].) 

6. Lower bounds for the growth rates of F(p) 

A lower bound of (3 + y/E)/2 = 2.618 ... for the growth rate of F = F(2) was obtained by 
the second author in pH]- Now we will find a similar lower bound for each F(p). In the 
previous section, we constructed a regular language C p of normal forms for F{p). Each 
word of length n in £ p is at a distance at most n from the identity in the Cayley graph of 
F(p). So the growth function of L v does not exceed the number of elements in the ball of 
radius n for F(p). Then, finding the growth function and the growth rate of C p , we find 
a lower bound for the growth rate of the group F(p). 

An automaton to recognize the language L v has 3p + 1 states. However, it is easier to 
construct a directed graph with only 2p + 1 vertices (states). This graph will be denoted 
by A p and we will also call it an automaton although its edges have no labels. The 
description of A p is as follows. 

The vertices (states) of A p are denoted by q, q , g 1; . . . , q p -i, g^o, q2,o, • ■• j Qp-i,o, Q- They 
will correspond to the following partition of C p into disjoint subsets: 

• The set { 1 } that consists of the empty word (state q). 

• The set of words that end with x^ 1 and do not have a terminal segment of the 
form xf 1 Xq, where l<i<p — 1, k>l (state go). 

• The set of words that end with xf 1 (state qi for each 1 < % < p — 1). 

• The set of words that end with xf x XQ (state g^o for each 1 <i < p — 1). 

• The set of words that end with x i x x\ for some 1 < i < p — 1 and k > 2 (state g). 

Let w G C p . If w is empty, then wxf 1 will be in C p for all < i < p — 1. We draw two 
arrows from g to g^ for each < % < p — 1 . 

Let w correspond to the state go- Then w = vx e for some word v and for some e — ±1. 
The word wx e Q will be in C p ; for each 1 < i < p — 1 the word wxf 1 will be also in C p since 
w has no terminal segments of the form x^Xq (1 < i < p — 1, k > 1. Thus we draw an 
arrow from go to itself and two arrows from go to each qi (1 < i < p — 1). 

Let w correspond to q^ (1 < i < p— 1). The words wXq 1 and wxq belong to C p ; we draw 
an arrow from qi to go and an arrow from g^ to g^o- The words wxj belong to C p whenever 
i < j < P — 1; the words wxf 1 belong to C p for all 1 < j < p — 1. So we draw one arrow 
from qi to each q 1} . . . , g^ and two arrows from g^ to each g i+1 , . . . , g p _i. 

Let w correspond to g^o (1<«<P — 1)- The word wxq is in C p and we draw an arrow 
from g i to q. Also wxj 1 G C p whenever i < j < p — 1. So one arrow goes from g i to 
each qi, . . . , g p _i. No other arrows can appear. 



Finally, let w correspond to q. Now only wx leads to a word in L p \ it corresponds to an 
arrow from q to itself. 

The description of A p is complete. Notice that the number of words in C p of length n is 
exactly the number of (directed) paths of length n in A p starting at q. We would like to 
compute the number of paths in A p of length n starting at q and ending at a given state. 
For each state we consider the corresponding generating function. Namely, to each vertex 
v we assign a series of the form J2^Lo a n^ n ■, where a n is the number of paths in A p starting 
at q and ending at v. These generating functions will be denoted by /, /j (0 < i < p — 1), 
fi,o (1 < * < V — 1)) / for each of the states, respectively. We will write down a system of 
equations for these functions. 

First of all, it is clear that f(t) = 1. To find fo, we mention that two arrows go from q 
into q and one arrow goes into q from each of the states q , qi, . . . , q p -\. Hence 

(13) /o = t(2/ + /o + /i + ••• + /*-!). 

Given a vertex q% (1 < i < p — 1), we observe that two arrows go into qi from q, qo, . . . , 
and one arrow from q iy . . . , q p -\. Also one arrow goes into from each g 10 , . . . , q i>0 . 
Thus 

(14) ft = t(2f + 2/ + • • • + 2/i_ a + £ + •■■ + /p_ a ) + t(A,o + • ■ ■ + A,o). 

Notice that Ao = t/i for each 1 < i < p — 1 because only one arrow goes into g^o (from 
the state q^. Thus we can rewrite (JHJ) as follows: 

(15) fi = t(2f + 2f + ■ ■ ■ + 2/<_! + /i + • • • + / p _0 + t 2 (/! + ■•■ + /■). 

Finally, there is one arrow that goes into q from each of the states gi )0 , . . . , q P -i,o, q~- So 

(16) / = t(/i,o + • • • + /p-1,0 + /) = t 2 (h + ■■■ + f p -i) + tf. 

In order to solve the system, let us consider the difference of equation ()15|) with i — 1 and 
(UJ). This gives h-h = tf + t 2 /i, that is, (1 - t 2 )/i = (1 + f)/o- So / = (1 - t)fr. 

Now suppose that 1 < i < p— 1. If we take the difference between and /j using (JT^ 
we obtain f i+1 - /» = t/i + t 2 /i+i, which implies / f = (1 - 

Now for all < i < p — 1 one has fi — (1 — The equation (jlHjl becomes 

(i - rVp-i = 2t + t/ P _i((i - ty- 1 + ■ ■ ■ + (i - 1) + 1) = 2t + f p ^(i - (i - t) p ) 

This gives 

2t 



(17) /p-i(t) 



(i - t)p + (i - ty- 1 - r 



In order to find the number of words in C p having length n, we need to add all the 
generating functions for all states. The result will be 

%(t) = i + fo + h + ■ • ■ + fp-i + t{h + ■■■ + U-i) + ■ (A + • • • + f P -i) 

(here we used fITBj) to express /). Taking into account that 

A + " • • + fp-i = ^ • fo - 2 



1U 



from ([13)1 . we have 

M*) = i + /o + (i + 1 + (/! + ••• + f P -i) = i + (i + ~) Jo - T 

Now, using /o = (1 — £) p-1 /p-i and (fTTj) . we finally have 



l-t (l - t)p + (l - t)?- 1 - 1 ' 

This is the generating function for C p . Thus the growth rate of C p will be the reciprocal 
of t, where t is the smallest positive root of the denominator of the right hand side of 

(ED. 

The number y — (1 — t)^ 1 is the root of y p = y + 1. It is clear that y > 1. Let y — 1 + x, 
where x > 0. We would like to solve the equation (1 + x) p = 2 + x. Notice that < x < 1. 
Since (1 + x) p < 3, the root x approaches as p goes to infinity. 

The equation (1 + x) p = 2 + x can be written as 

ln(2 + x) ln2 + ln(l +x/2) ln2 + x/2 + o(x) 



P 



ln(l + x) ln(l + x) x — x 2 /2 + o(x 2 



Therefore, 

< 19 > 1^ = = + 'V + = *" + 5 + "« 

as p — ► oo. 

We are interested in the number £ p = where t is the root of (1 — t) p + (1 — i) p_1 — 1 = 0. 
Here (1 — t)^ 1 = y, where y = 1 + x is the root of y p = y + 1. It is easy to see that 
£ p — 1 + x -1 . So we deduce from (fT9")l that 

e P = ^f + ^ + o(i), p-,oo. 

It is also easy to see that £ p = t" 1 satisfies the following equation: (2£ — 1)(£ — l) p_1 = £ p . 
So we proved 

Theorem 6.1. The growth rate of the group F(p), p > 2 has a lower bound of £ p , where 
£ p satisfies the equation 

(2e-i)(e-ir 1 = e 

The following asymptotic formula holds: 

e P = ^ + ^ + o(i), p-oo. 

Here are several numerical values of £ p : 

£ 2 = 2.618033989 
£ 3 = 4.079595623 
£ 4 = 5.530132718 
£ 5 = 6.977144180 

and so on. For large values of p, the growth rate of F(p) is at least 0.72(2p — 1) (recall 
that 2p — 1 is the maximum value of the growth rate of a p-generated group; this happens 
if and only if the group is free of rank p) . 



It would be interesting to find nontrivial upper bounds for the growth rates of F(p). This 
means to find a constant c < 1 such that the growth rate of F(p) in its natural generators 
does not exceed c(2p — 1). 
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